Skip to main content

Requirements Gathering Summary Report

Projects 1, 2, 3, 4, 5

Week 1, T2, 2024

1. Data Sources and Formats

ProjectCurrent Data SourcesFuture Data SourcesData Formats
Project 1VR sensors-JSON (assumed)
Project 2External data IoT devices via ThingSpeakTo be providedTo be provided
Project 3.fit files (Garmin, Strava)-CSV
Project 4OpenCV, supervisionQuantitative data from video/camera streamNot specified
Project 5User sessions, temporary dataUser logins, projects, responses, rewards systemJSON

2. Data Volume, Frequency, and Performance

ProjectData VolumeFrequencyPerformance
Project 1A couple megabytesEvery 10 secondsNo latency, speed, scalability requirement
Project 25-10GBReal-timeNot specified
Project 350MB-Not specified
Project 4Normal to mediumReal-timeNot specified
Project 5SmallReal-timeReal-time access

3. Data Quality and Cleanup

ProjectData QualityCleanup Required
Project 1--
Project 2To be providedTo be provided
Project 3Varies, mostly clean-
Project 4To be assessedTo be assessed
Project 5No missing values or cleanup needed-

4. Data Shape and Real-Time Needs

ProjectData ShapeReal-Time Needs
Project 1Semi-structured-
Project 2Structured or semi-structured (To be confirmed)Real-time (To be confirmed)
Project 3Structured (CSV)-
Project 4Real-timeReal-time
Project 5ResearchReal-time

5. Pain Points and Data Warehouse Need

ProjectPain PointsData Warehouse Need
Project 1-SQL server and API access
Project 2ThingSpeak limitationsNeeded for improvement
Project 3No current need for data warehouseGood practice but unnecessary
Project 4Need for pre-setup warehouseHelpful for faster setup
Project 5Lack of backend knowledgeExternal data solution needed

6. Data Warehouse Setup Ideas

ProjectSetup Ideas
Project 1SQL server, API access
Project 2Ideas for preprocessing
Project 3SQL database, API
Project 4Prefer MongoDB, API for UI streaming
Project 5External data solution, integration with EC2

Projects with Similar Requirements

Real-Time Data Needs

  • Project 2: Requires real-time data for tasks like fall detection (To be confirmed).
  • Project 4: Uses real-time streaming data from video/camera stream.
  • Project 5: Needs real-time access for its live service.

Data Warehouse Requirement

  • Project 1: SQL server setup and API access.
  • Project 2: Data warehouse needed to overcome ThingSpeak limitations.
  • Project 4: Data warehouse setup would help in faster implementation.
  • Project 5: External data solution with data warehouse team support.

Data Formats

  • Project 1: Assumed to use JSON format.
  • Project 5: Uses JSON format for data configuration.

Pain Points with Current Data Methods

  • Project 2: Limited functionalities with ThingSpeak.
  • Project 4: Pre-setup warehouse would speed up the process.
  • Project 5: Lack of backend knowledge in the team.

Ideas for Data Warehouse Setup

  • Project 1: SQL server and API.
  • Project 4: Prefers MongoDB with API for UI streaming.
  • Project 5: External data solution integrated with EC2.